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Abstract 

We study the long memory of order flow for each of three liquid cur¬ 
rency pairs on a large electronic trading platform in the foreign exchange 
(FX) spot market. Due to the extremely high levels of market activity 
on the platform, and in contrast to existing empirical studies of other 
markets, our data enables us to perform statistically stable estimation 
without needing to aggregate data from different trading days. We find 
strong evidence of long memory, with a Hurst exponent of H ~ 0.7, for 
each of the three currency pairs and on each trading day in our sample. 
We repeat our calculations using data that spans different trading days, 
and we hnd no signihcant differences in our results. We test and reject 
the hypothesis that the apparent long memory of order flow is an artifact 
caused by structural breaks, in favour of the alternative hypothesis of true 
long memory. We therefore conclude that the long memory of order flow 
in the FX spot market is a robust empirical property that persists across 
daily boundaries. 


1 Introduction 

The autocorrelation properties of financial time series have been the subject 
of fierce debate for more than 50 years dzi HOI Hi ss sni- Several important 
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properties of financial markets have been reported to exhibit autocorrelations 
that decay slowly, often over periods of days or even months uni ni [muni El 
154) . Such observations have prompted some authors to conjecture that some 
financial time series exhibit a phenomenon known as long memory IIEIIII]: 
which means that the decay of autocorrelation is sufficiently slow that the sum 
of terms in their autocorrelation function (ACF) diverges to infinity. 

In recent decades, the widespread uptake of electronic limit order books 
(LOBs) [30] in many hnancial markets has facilitated the recording of order-flow 
data, which provides a detailed description of traders’ actions and interactions 
at the microscopic scale. The availability of such data has ignited interest in the 
possibility that financial markets exhibit long memory at the level of individual 
order flow, and several empirical studies during the past decade have reported 
this to be the case in a wide variety of different markets [131 EH El ES] ■ 

In a recent publication, Axioglou and Skouras [2| challenged this view. They 
noted that, in order to construct sufficiently long time series to perform statis¬ 
tically stable estimation, existing studies of long memory in order flow have 
aggregated data from multiple trading days. Axioglou and Skouras argued that 
the apparent long memory reported by many studies is mostly an artifact caused 
by aggregating the data in this way. Specifically, they argued that the statisti¬ 
cal properties of order flow change each day, and that concatenating order-flow 
series from different trading days creates nonstationarities at the boundaries 
between daily series. Moreover, statistical tests are known to produce similar 
output for nonstationary series as they do for stationary series with long mem¬ 
ory [SI [23 El] • Therefore, distinguishing between these alternatives is a difficult 
task. 

Assessing whether or not order flow really is a long-memory process is im¬ 
portant for several reasons. From a practical perspective, the present values of a 
long-memory process are correlated with values in the distant future [H] , so iden¬ 
tifying and quantifying the strength of long memory is useful for forecasting. 
From a theoretical perspective, several recent publications suggest that long- 
range autocorrelations in order flow may hold the key to understanding the com¬ 
plex statistical properties of price formation in financial markets (e.g., price im¬ 
pact, volatility, and the heavy-tailed distribution of returns [IlElE51EHl[7n]). 
Moreover, if order flow really is a long-memory process, then identifying the 
sources of long-range autocorrelations may provide insight into traders’ strate¬ 
gic decision-making processes [iiiisn]. 

In this paper, we perform an empirical study of a new, high-quality data set 
from a large electronic trading platform in the foreign exchange (FX) spot mar¬ 
ket to assess the long-memory properties of order flow for three liquid currency 
pairs. Due to the extremely high levels of market activity on the platform, and 
in contrast to existing empirical studies of other markets, our data enables us 
to perform statistically stable estimates of the long-memory properties of intra¬ 
day order flow without needing to aggregate data from different trading days. 
We are therefore able to exclude the possibility that our results are influenced 
by nonstationarities at the boundaries between different trading days, and we 
thereby avoid Axioglou and Skouras’ [2] criticism of previous studies. For each 
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of the three currency pairs and on all trading days in our sample, we find strong, 
statistically significant evidence for long memory in order flow. 

To investigate how aggregating data from different trading days impacts our 
results, we also concatenate pairs of adjacent intra-day order-flow series to create 
cross-day series, which cross daily boundaries. We repeat all of our calculations 
on these cross-day series, and we find that our results are very similar to those 
for the intra-day series. We test and reject the hypothesis that the apparent long 
memory that we observe is an artifact caused by structural breaks, in favour of 
the alternative hypothesis of true long memory. We therefore conclude that the 
long memory of order flow in the FX spot market is a robust empirical property 
that persists across daily boundaries. 

Several important differences separate our work from previous studies of 
long memory in order flow. First, our data originates from the FX market, 
whereas previous studies have analyzed data from equities markets (anaiiii 
[^|5S]. The FX market is the largest market in the world, so understanding its 
statistical properties is an important task. Second, the microstructural trade¬ 
matching rules in the FX market differ from those in equities markets. We 
provide a detailed comparison of these trading mechanisms in Section S By 
examining how these differences impact traders’ actions, we are able to gain 
additional insight into the underlying causes of the long memory that we observe. 
Third, because of the extremely high levels of activity in our data, we are 
able to perform statistically stable estimation of the long-memory properties of 
intra-day order flow without needing to aggregate data from multiple trading 
days. We are therefore able to test and reject Axioglou and Skouras’ conjecture 
that the apparent long memory of order flow is mostly an artifact caused by 
aggregating data in this way. Fourth, we employ a wide range of estimators 
and statistical techniques to ensure that our quantitative assessment of long 
memory is robust. By contrast, some previous empirical studies have based 
their conclusions on single estimators, whose output can be misleading (as we 
demonstrate in Section 5.1). Together, these differences enable us to perform a 
detailed analysis of the strength, origins, and nature of long memory in order 
flow in a highly liquid but hitherto unexplored market. 

The paper proceeds as follows. In Sectionj^ we provide a detailed discussion 
of long memory in order flow. In Section we review the findings of several 
empirical studies of long memory in order flow. We describe our data in Section 
1^ We present our main results in Section and we discuss our findings in 
Section We conclude in Section In we present a technical overview of 
long memory. In we discuss the statistical techniques that we use to assess 
the long-memory properties of order flow. In we discuss the statistical tools 
that we use to distinguish between a long-memory series and a short-memory 
series with nonstationarities. 
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2 The Long Memory of Order Flow 

In this paper, we perform an empirical analysis of the long-memory properties 
of order flow. First, we recall the formal definition of long memory and provide 
a detailed discussion of the order-flow series that form the basis of our empirical 
study. 


2.1 Long Memory 

Let 

{Wt} = Wi,W2,... 

denote a real-valued, second-order stationarjQ time series with mean 

E(VFt) = 

autocovariance function 


7 (fc) = cov {Wt,Wt+\k\) 
and autocorrelation function (ACF) 

l{k) 


p(k) = 


7(0)’ 


The time series {Wt} is said to exhibit short memory if 


N 


lim V |p(fc)|<oo. 

7—Vrvo t ^ 


N—>-oo 


k^-N 


The time series {W^} is said to exhibit long memory if 


N 


lim V \p{k)\ = oo. 

N^oo ^^ 
k^-N 

For a technical introduction to long memory, see and [6] . 


( 1 ) 

( 2 ) 

(3) 

(4) 

(5) 

( 6 ) 


2.2 Limit Order Books 

More than half of the world’s financial markets use limit order books (LOBs) 
to facilitate trade m- In contrast to quote-driven systems, in which prices 
are set by designated market makers, trade in an LOB occurs via a continuous 
double-auction mechanism in which institutions submit orders. An order x = 
{Px,<-^x,tx) submitted at time tx with price Px and size ujx > 0 (respectively, 
ujx < 0) is a commitment by its owner to sell (respectively, buy) up to \ujx \ units 
of the asset at a price no less than (respectively, no greater than) px- 

time series {Wt} is second-order stationary if its first and second moments are finite 
and do not vary with time |16l I6,f | . 
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Whenever an institution submits a buy (respectively, sell) order x, an LOB’s 
trade-matching algorithm checks whether it is possible for x to match to an 
active sell (respectively, buy) order y such that Py < Px (respectively, Py > 
Px)- If so, the matching occurs immediately and the owners of the relevant 
orders agree to trade the specified amount at the specihed price. If not, then 
X becomes active, and it remains active until either it matches to an incoming 
sell (respectively, buy) order or it is cancelled. 

Orders that result in an immediate matching upon arrival are called market 
orders. Orders that do not — instead becoming active orders — are called limit 
ordersj^ The LOB C(t) is the set of all active orders for a given asset on a given 
platform at a given time t. For a detailed introduction to LOBs, see |30j . 

Many LOBs record comprehensive digital transcriptions of order flow on a 
given platform. These transcriptions provide an event-by-event account of the 
temporal evolution of C{t), and they thereby enable detailed empirical analysis 
of hnancial markets at the microscopic scale [T5] . 


2.3 Order-Sign Series 

Given a sequence of N consecutive arrivals of limit orders into C{t), the order- 
arrival series 

^XI ; ^X2 I ■ ■ ■ J ^Xn (7) 

is the time series of the arriving limit orders’ sizes. Similarly, given a sequence of 
M consecutive departures of active orders from C{t), the order-departure series 


j ) ^x'.^ ) ■ ■ • ) ‘•^x'j^ ( 8 ) 

is the time series of the departing active orders’ sizes. An entry in the order- 
arrival series always corresponds to the arrival of a new limit order, but an entry 
in the order-departure series can occur either because an active order is cancelled 
or because an incoming market order triggers a matching and thereby removes 
a limit order from the LOBTogether, the order-arrival and order-departure 
series completely determine the temporal evolution of C{t). 

When studying the long-memory properties of order-flow series, it is cus¬ 
tomary to study the time series of order signs. For a given order Xi of size tVxi , 
the order sign Li is given by 


j- _ / -1) if ‘^xi < 0, 

\ +1, ifw,. >0. 

Recall from Section |2.2| that an order has negative size if and only if it is a buy 
order. Therefore, an order-sign series is simply a time series of ±ls, where —1 
entries correspond to buy-order activity and -1-1 entries correspond to sell-order 
activity. 

^Some platforms allow other order types (such as fill-or-kill, stop-loss, or peg orders Ell), 
but it is always possible to decompose the resulting order flow into limit and/or market orders. 
Therefore, we study LOBs in terms of these simple building blocks. 

^Incoming market orders are not reported in the order-arrival series. 
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The reason for studying time series of order signs — instead of the corre¬ 
sponding time series of order sizes — is that empirical studies of a wide variety 
of different markets have reported that order sizes often vary over several or¬ 
ders of magnitude (see |3D] for a recent survey of empirical studies of LOBs). 
This brings into question the convergence properties of higher-order moments 
of time series of order sizes. By contrast, studying only the time series of order 
signs guarantees that all moments exist, while still providing insight into the 
long-range autocorrelation properties of buy and sell activity in order flow. 


3 Literature Review 


Early studies of the autocorrelation properties of order flow tended to focus 
on short-range (i.e., small-lag) autocorrelations in order-sign series. Hasbrouck 
[33] studied the order-sign series for trades on the New York Stock Exchange 
(NYSE) during March-April 1985. He reported that lag-1 autocorrelations were 
strongly positive and that the mean sample ACE (see B.l) across all stocks in 
the sample was positive up to lags of at least 200. Biais et al. H] studied 
order-sign series for market orders, limit orders, and cancellations for 40 stocks 
traded on the Paris Bourse in 1991. For each type of order flow, they reported 
that any given event type (e.g., buy market order) was likely to be followed by 
another event of the same type. Ellul et al. [53] and Yeo m both reported 
similar findings for activity on the NYSE during 2001. 

More recent work has focused on the long-memory properties of order-sign 
series. Lillo and Farmer [41] studied order-sign series for limit order arrivals, 
market order arrivals, and cancellations for 20 stocks on the London Stock 
Exchange (LSE) during 1999-2002. They used a wide variety of statistical 
techniques and estimators to test and reject the hypothesis that these series were 
short-memory series, in favour of the alternative hypothesis of long memory. 
They also estimated the Hurst exponent (see |A.3| ) for each series and reported a 
mean value of iL « 0.7. The cross-sectional variation in H across the stocks that 
they studied was small but significant. Because their sample ACEs contained 
no significant peaks or breaks corresponding to the length of a single trading 
day, Lillo and Farmer argued that long memory in order flow persists across 
daily boundaries. They also repeated their experiments on similar data from 
the NYSE and found similar results. 

Bouchaud et al. [13] studied long-range autocorrelations in the order-sign 
series for market orders on Euronext in 2001-2002. For all of the stocks that they 
studied, they reported that the sample ACEs decayed approximately according 
to a power law. They estimated each stock’s power-law exponent directly from 
its sample ACE and reported values that correspond to Hurst exponents (see 
Equation (dU)) ranging from H « 0.65 to H m 0.9. Similarly to Lillo and 
Farmer, Bouchaud et al. argued that long-range autocorrelations in order flow 
persist across daily boundaries. 

Mike and Farmer [55] studied order-sign series for both market orders and 
limit orders for 25 stocks traded on the LSE during 2000-2002. They used 
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detrended fluctuation analysis (DFA) (see B.4| to estimate the Hurst exponent 
for each stock and reported values ranging from H « 0.75 to « 0.88, with a 
mean of H « 0.83 across all stocks. 

To date, two mechanisms have been proposed to explain the slow decay of 
autocorrelations in order-flow series. The first is that traders display “herding” 
behaviour, either because they all respond similarly to common information 
or because they monitor each other’s actions and update their strategies by 
imitating those of their most successful competitors [40]. The second is that 
traders who wish to perform large trades decompose them into smaller chunks, 
which they submit over several days (or even months) to minimize their market 
impact nidi HD. This strategy is commonly known as order splitting. 

Gerig m assessed the plausibility of these two explanations by studying 
order-flow series from the LSE. In contrast to most LOB data sets, Gerig’s 
data included information about the broker that submitted each order. This 
enabled him to compare the autocorrelation properties of order flow generated 
by individual brokers to those of the aggregate order flow generated by all 
brokers. He reported that correlations across different brokers decayed quickly 
to 0, but that autocorrelations in order flow from individual brokers exhibited 
long memory. He therefore argued that order splitting is a much more plausible 
explanation for long memory in order flow than is herding. Toth et al. ISZlISSj 
also studied data containing brokerage identifiers from the LSE, and reached a 
similar conclusion. 

Recently, however, Axioglou and Skouras [5] challenged the notion that order 
flow exhibits long memory by arguing that this apparent effect was mainly an 
artifact caused by nonstationarities in the underlying order-flow series. Specif¬ 
ically, they noted that to construct sufficiently long time series to perform sta¬ 
tistically stable estimation of long-range autocorrelations, existing studies have 
aggregated order-flow data from different trading days. They argued that such 
aggregation produces order-flow series with structural breaks at the daily bound¬ 
aries. Because many statistical tests produce similar output for nonstationary 
series as they do for stationary series with long memory (see Axioglou and 
Skouras conjectured the apparent long memory in order flow is mainly an arti¬ 
fact caused by these structural breaks. 

To test this hypothesis, Axioglou and Skouras studied the order-sign series 
for market orders on the LSE during 2005-2006. They first aggregated data 
from several different trading days, and they noted that standard statistical 
tests applied to this data concluded strongly in favour of long memory. They 
then constructed shorter time series by aggregating data across pairs of consec¬ 
utive trading days. They applied the cumulative-sum change-point estimator 
(see Equation (391) to these series, and they were able to detect the daily bound¬ 
aries with high accuracy. They then applied Berkes’ change-point test (see 
to test the hypothesis that the apparent long memory in the cross-day series 
was actually due to a structural break. Working at the 5% significance level, 
they could not reject the null hypothesis of a piecewise stationary series with a 
structural break in favour of the alternative hypothesis of true long memory in 
about two thirds of the order-flow series that they studied. They concluded that 
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although order flow exhibited significant autocorrelations within a single trad¬ 
ing day, the strength of autocorrelations that persisted across daily boundaries 
was very weak. 


4 Data 


We have been granted access to a recent, high-quality data set from Hotspot 
FX [38l |37j , which is one of the largest multi-institution trading platforms in 
the FX spot market. According to the 2010 Triennial Central Bank Survey |1], 
the mean daily turnover of the global FX market was approximately 4.0 tril¬ 
lion US dollars. Approximately 37% of this volume was due to spot trades, of 
which approximately 40% was conducted electronically. In total, the mean daily 
volume traded on all multi-institution electronic trading platforms was approx¬ 
imately US 0.6 trillion USD [4]. The mean daily volume traded on Hotspot FX 
during the same period was approximately US 21.5 billion USD [5S]. There¬ 
fore, trade on Hotspot FX accounted for approximately 4% of all volume traded 
electronically in the FX spot market during this period. 

Hotspot FX offers trade for more than 60 different currency pairs. A price for 
the currency pair XXX/YYY denotes how many units of the counter currency 
YYY are exchanged per unit of the base currency XXX. Trade for each currency 
pair occurs in a separate LOB with price-time priority. 

The Hotspot FX platform serves a broad range of trading professionals — 
including banks, financial institutions, hedge funds, high-frequency traders, cor¬ 
porations, and commodity trading advisers [38] . As is customary on multi¬ 
institution trading platforms in the FX spot market. Hotspot FX enables insti¬ 
tutions to specify credit limits for their trading counterparties. Each institution 
can only access the trading opportunities offered by counterparties with whom 
they possess sufficient bilateral credit. We call this market organization a quasi- 
centralized limit order hook (QCLOB), because different institutions have access 
to different subsets of a centralized liquidity pool. For a detailed discussion of 
QCLOBs, see [53] • Examples of platforms that utilize QCLOBs include Reuters 
[5S] . EBS [12], and Hotspot FX |3H|. For more details about trade on Hotspot 
FX, see |S7|. 

The data that we study describes the full order-arrival and order-departure 


series (see Section 2.31 for the EUR/USD (Euro/US dollar), GBP/USD (Pounds 


sterling/US dollar), and EUR/GBP (Euro/Pounds sterling) currency pairs dur¬ 
ing the peak trading hours of 08:00:00-17:00:00 GMT on 30 trading days during 
May-June 2010. Although trade in the FX spot market continues to operate 
outside of these hours, more than 70% of the total traded volume for each of 
the three currency pairs occurs during these 9 hours each day. 

For a given trading day we use the Hotspot FX data (see Section]^ to 
produce an ordered list of the limit order arrivals that occur during the peak 
trading hours of 08:00:00-17:00:00 GMT. We then use Equation ([^ to deduce 
the intra-day arrival sign series from this list. Similarly, we use the Hotspot 
FX data to produce an ordered list of the active order departures that occur 



during the same period, and we then use Equation ([^ to deduce the intra-day 
departure sign series from this list. We repeat this process for each of the 30 
trading days Di, D 2 , ■ ■ ■, D 30 in our sample. 

In Table we list the minimum, maximum, and mean number of limit order 
arrivals and departures across these 30 trading days. The number of arriving 
limit orders is largest for EUR/USD and smallest for EUR/GBP. For each of the 
three currency pairs and on most trading days that we study, the total number 
of limit order arrivals slightly exceeds the total number of departures, which 
implies that active orders accumulate throughout the trading day. 




EUR/USD 

GBP/USD 

EUR/GBP 

Number of Arrivals 

Minimum 

Maximum 

Mean 

3455561 

6003406 

4533550.8 

2962688 

5296372 

4340345.4 

2019826 

3623053 

2932726.8 

Number of Departures 

Minimum 

Maximum 

Mean 

3449793 

5992343 

4524175.4 

2961217 

5293082 

4337320.6 

2019672 

3622559 

2932171.5 


Table 1: Minimum, maximum, and mean number of limit order (top 
panel) arrivals and (bottom panel) departures for EUR/USD, GBP/USD, and 
EUR/GBP, measured across the 30 trading days in our sample. 


To quantify the imbalance between buying and selling activity, we also calcu¬ 
late the percentage of sell orders (i.e., percentage of -1-1 entries in the order-sign 
series) among limit order arrivals and departures each day (see Table [^. For 
each of the three currency pairs and for both arrivals and departures, sell orders 
account for close to 50% of order flow. Therefore, the level of buying activity is 
approximately equal to the level of selling activity on all trading days that we 
study. 




EUR/USD 

GBP/USD 

EUR/GBP 

Percentage of Arriving Sell Orders 

Minimum 

Maximum 

Mean 

48.664% 

50.870% 

50.008% 

49.389% 

51.255% 

50.135% 

48.817% 

50.252% 

49.894% 

Percentage of Departing Sell Orders 

Minimum 

Maximum 

Mean 

48.667% 

50.879% 

50.009% 

49.389% 

51.257% 

50.134% 

48.815% 

50.251% 

49.895% 


Table 2: Minimum, maximum, and mean percentages of sell orders among 
limit order (top panel) arrivals and (bottom panel) departures for EUR/USD, 
GBP/USD, and EUR/GBP across the 30 trading days in our sample. 
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5 Results 

In this section, we present our empirical results for the arrival-sign series. The 
corresponding results for the departure-sign series are qualitatively similar. 


5.1 Results for Intra-Day Series 


In Figure we plot the sample ACFs (see B.ll for each of the three cur¬ 
rency pairs’ intra-day arrival-sign series on 4 May 2010. The results for all 
other intra-day series on each day in our sample are qualitatively similar. Up 
to lags of about 25 events, the sample ACFs fluctuate between positive and 
negative values, which indicates that the order-flow series contain short-range 
negative autocorrelations. Although these autocorrelations have a magnitude 
below about 0.1 and are therefore relatively weak, this effect is present on each 
day in our sample, and we therefore deem it to be a robust statistical property 
of the data. We return to our discussion of these negative autocorrelations in 
Section 0 

In Figure we plot the intra-day sample ACFs in doubly logarithmic co¬ 
ordinates. To help reduce the noise at higher lagsj^we plot the mean sample 
ACFs, which we obtain by averaging the daily sample ACFs across all 30 days 
in our sample. After the short-term negative autocorrelations subside (which 
occurs before lag 50, the lower bound in our plots), the sample ACFs remain 
positive for lags of several thousands of events. This suggests that there are 
long-range, positive autocorrelations in the series. 

In Figure]^ we show rescaled-range plots (see B.2) for each of the three 
currency pairs’ intra-day arrival-sign series. For each of the three currency 
pairs, the slope of the rescaled-range plot for each intra-day order-flow series 
is close to 0.5 for values of k below about 10000. For larger values of k, the 
slope of each rescaled-range plot is above 0.5. This suggests that the intra-day 
order-flow sign series are long-memory processes. 

To test the hypothesis of long memory more formally, we perform Lo’s mod¬ 
ified rescaled-range test (see B.3) on the intra-day arrival series (see Figure]^. 
For each day in our sample, Lo’s modified rescaled-range test causes us to reject 
the null hypothesis of short memory at the 5% significance level at all bandwidth 
choices that we study. Similarly, when using Andrews’ plug-in estimator (see 
Equation (29)) to estimate a suitable choice of bandwidth parameter from the 
data, Lo’s test rejects the null hypothesis of short memory for each of the three 
currency pairs and on all 30 days in our sample. Therefore, Lo’s test provides 
strong evidence that the intra-day order-sign series are long-memory processes. 

Given the strong results of Lo’s test, we now turn to assessing the strength of 
the long memory in intra-day order flow. To do so, we use two different methods 


to estimate the Hurst exponent H (see A.31: DFA (see B.41 and log-periodogram 


regression (see B.5). Due to the negative short-range autocorrelations that we 


^The statistical errors associated with estimating the sample ACF are approximately con¬ 
stant at all lags, but the signal strength is smaller at larger lags because there are fewer 


independent data points m- 
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Figure 1: Sample ACFs (see |B.1[ ) for (top row) EUR/USD, (middle row) 
GBP/USD, and (bottom row) EUR/GBP intra-day arrival-sign series, which 
we construct using Equation (|^. Each plot shows the sample AGE for 4 May 
2010. The corresponding results for all other days in our sample and for the 
intra-day departure-sign series are qualitatively similar. 
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Figure 2: Doubly logarithmic plots of mean sample ACFs (see B.l) for (top row) 
EUR/USD, (middle row) GBP/USD, and (bottom row) EUR/GBP intra-day 
arrival-sign series. We obtain each plot by averaging the daily sample AGFs 
across all 30 days in our sample. We omit lags smaller than 50 events because 
some values of the mean sample ACFs are negative in this range. The results 
for the intra-day departure-sign series are qualitatively similar. 
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Figure 3: Rescaled-range plots (see B.2) for (solid green curves) EUR/USD, 
(dashed orange curves) GBP/USD, and (dotted-dashed purple curves) 
EUR/GBP arrival-sign series. The pale curves indicate the rescaled-range statis¬ 
tics R{k) (see Equation (24)) for a single intra-day series, and the darker curves 
indicate the mean across all 30 intra-day series. The dotted black line has a 
slope of 0.5. In these plots, we divide each intra-day series into B = 100 blocks; 
we also produced similar plots for several different values of R S [10,1000] and 
obtained similar results. The results for the intra-day departure-sign series are 
qualitatively similar. 
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Figure 4: Box plots of Lo’s modified rescaled-range test statistic V{q) (see |B.3| ) 
for the given choices of bandwidth parameter q for the (top) EUR/USD, (mid¬ 
dle) GBP/USD, and (bottom) EUR/GBP intra-day arrival-sign series. For each 
choice of q, the boxes indicate the lower quartile, median, and upper quartile 
of V{q) and the whiskers indicate the minimum and maximum of V{q), across 
all 30 intra-day series. The light grey shading indicates the critical region for 
Lo’s modified rescaled-range test at the 5% significance level. The results for 
the intra-day departure-sign series are qualitatively similar. 
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observe in the sample ACFs (see Figure [^, it is necessary to identify sensible 
choices of input parameters — namely, the minimal window length mmin of a 
DFA and the number c of Fourier frequencies in a log-periodogram regression 
— when performing these estimation techniques. 

To identify a suitable choice of TOmin for our DFA estimates of H, we hrst 
plot the length-m mean detrended standard deviation F{m) for several choices 
of m (see Figure]^. For window lengths m that are smaller than about 25, 
the negative autocorrelations dominate the mean detrended standard deviations 
F{m). These values of m are therefore unsuitable for calculating a DFA estimate 
of FI. For values of m larger than about 100, the log-log plots of F{m) follow 
an approximately straight line. We therefore perform our DFA estimates of H 
using mniin = 100. 

o 



Figure 5: Length-m mean detrended standard deviation F(m) (see |B.4[ ) for 
the (solid green curves) EUR/USD, (dashed orange curves) GBP/USD, and 
(dotted-dashed purple curves) EUR/GBP arrival-sign series. Each curve cor¬ 
responds to a single intra-day series. The dotted black line indicates m = 100. 
The results for the intra-day departure-sign series are qualitatively similar. 

To identify a suitable choice of c for our log-periodogram regression estimates 
of H, we plot the log-periodogram regression estimates of H for several different 
values of c (see Figure]^. In all cases, the estimates of F[ tend to decrease as c 
increases, and there is no clear plateau over which the estimates of H are stable. 
In the absence of an obvious choice for c, we use the popular rule-of-thumb [26] 
c = '/N, where N is the length of the given series. We stress, however, that 
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the plots in Figure [^indicate that our log-periodogram regression estimates of 
H depend heavily on this choice, so using a different choice for c would produce 
quantitatively different results. For example, another popular rule-of-thumb [62] 
is c = 0.1 X {N/2). Due to the extremely large size of the intra-day order-flow 
series, this choice produces estimates of Ff « 0.5, which we do not regard to be 
sensible given that the other statistical tests all suggest that the intra-day order- 
flow series exhibit long memory. The absence of a clear choice for c highlights 
a weakness of log-periodogram regression for the present application. 
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Figure 6: Log-periodogram regression (see B.51 estimates of H for given number 
c of Fourier frequencies for the (solid green curves) EUR/USD, (dashed orange 
curves) GBP/USD, and (dotted-dashed purple curves) EUR/GBP arrival-sign 
series. Each curve corresponds to a single intra-day series. The results for the 
intra-day departure-sign series are qualitatively similar. 


In Figure we plot the DFA and log-periodogram regression estimates of H 
(using our choices of = 100 and c = '/N) for each intra-day arrival-sign 

series. For each of the three currency pairs, the DFA estimates of H cluster in 
the range of about 0.6 to about 0.8. The log-periodogram regression estimates 
of H tend to be slightly larger (they cluster in the range of about 0.65 to about 
0.85). However, the latter results depend heavily on the choice of c (see Figure 
[^, so we deem the DFA estimates to be more useful. 

In Table we list the means and standard deviations of our DFA and log- 
periodogram regression estimates of H across all 30 days in our sample. As we 
noted above, our results for arrival-sign series and departure-sign series are very 
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Figure 7: DFA and log-periodogram regression estimates of the Hurst exponent 
H for the intra-day (green squares) EUR/USD, (orange circles) GBP/USD, and 
(purple triangles) EUR/GBP arrival-sign series. Each point corresponds to the 
estimates for a single trading day. The dotted black line indicates the diagonal. 
The results for the intra-day departure-sign series are qualitatively similar. 
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similar. In all cases, the mean estimates of H are many standard deviations 
larger than 0.5, which strongly supports the hypothesis that these series exhibit 
long memory. Based on our results in Figures[^[^ andj^and Table[^ we regard 
H « 0.7 to be a good estimate for the Hurst exponent of the arrival-sign series 
and departure-sign series for each of the three currency pairs. 




DFA 

Log-Periodogram Regression 

EUR/USD 

Arrivals 

Departures 

0.70 (0.04) 
0.70 (0.04) 

0.74 (0.03) 

0.74 (0.03) 

GBP/USD 

Arrivals 

Departures 

0.72 (0.04) 
0.72 (0.03) 

0.74 (0.03) 

0.74 (0.03) 

EUR/GBP 

Arrivals 

Departures 

0.72 (0.02) 
0.71 (0.02) 

0.79 (0.04) 

0.78 (0.04) 


Table 3: DFA and log-periodogram regression estimates of the Hurst exponent 
H for the EUR/USD, GBP/USD, and EUR/GBP intra-day arrival-sign and 
departure-sign series. Each entry indicates the mean of the estimates across all 
intra-day series. The numbers in parentheses indicate 1 standard deviation of 
the estimates across all intra-day series. 


5.2 Results for Cross-Day Series 

To assess how aggregating data from different trading days impacts our results, 
we now repeat all of our calculations using order-flow series that span daily 
boundaries. Specifically, for a pair of consecutive trading days Di and 7?i+i, 
we construct the cross-day arrival-sign series by concatenating the second halQ 
of the intra-day arrival-sign series from day Di and the first half of the corre¬ 
sponding intra-day arrival-sign series from day Di+i. We construct the cross-day 
departure-sign series similarly using the corresponding departure-sign series. 

In all cases, we find that the sample AGFs, rescaled-range plots, and results 
from Lo’s modified rescaled-range test are qualitatively similar to those for the 
intra-day series (see Figures and|^. This provides strong evidence that 

the cross-day order-flow series exhibit long memory. To quantify the strength of 
this long memory, we calculate DFA and log-periodogram regression estimates of 
the Hurst exponent H using the same choices of input parameters (mmin = 100 
and c = y/N) as we used for our corresponding estimates of H for the intra¬ 
day series. We plot our results in Figure and list the means and standard 
deviations of our estimates across all 29 cross-day periods in Table 

Our estimates of H for the cross-day series are very similar to the corre¬ 
sponding estimates for the intra-day series (see Figure]^ and Table [^. As with 
the intra-day series, the DFA estimates of H cluster in the range of about 0.6 
to about 0.8 and the log-periodogram regression estimates of H cluster in the 
range of about 0.65 to about 0.85. We therefore regard H « 0.7 to be a good 

®If an intra-day series has odd length, we round down to the previous integer. 
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Figure 8: DFA and log-periodogram regression estimates of the Hurst exponent 
H for the cross-day (green squares) EUR/USD, (orange circles) GBP/USD, and 
(purple triangles) EUR/GBP arrival-sign series. Each point corresponds to the 
estimates for a consecutive pair of trading days. The dotted black line indicates 
the diagonal. The results for the cross-day departure-sign series are qualitatively 
similar. 




DFA 

Log-Periodogram Regression 

EUR/USD 

Arrivals 

Departures 

0.71 (0.03) 
0.70 (0.03) 

0.74 (0.03) 

0.74 (0.03) 

GBP/USD 

Arrivals 

Departures 

0.73 (0.03) 
0.73 (0.03) 

0.74 (0.03) 

0.74 (0.03) 

EUR/GBP 

Arrivals 

Departures 

0.72 (0.03) 
0.72 (0.03) 

0.79 (0.04) 

0.78 (0.04) 


Table 4: DFA and log-periodogram regression estimates of the Hurst exponent 
H for the EUR/USD, GBP/USD, and EUR/GBP cross-day arrival-sign and 
departure-sign series. Each entry indicates the mean of the estimates across all 
cross-day series. The numbers in parentheses indicate 1 standard deviation of 
the estimates across all cross-day series. 
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estimate for the Hurst exponent of the cross-day order-sign series for each of 
the three currency pairs. 


5.3 Long Memory versus Nonstationarity 

In this section, we address the conjecture by Axioglou and Skouras [2] that the 
apparent long memory in the cross-day series is mostly an artifact caused by 
structural breaks at the boundaries between different trading days (see Section 

§■ 

To assess whether the estimator is able to identify the boundary between 
different trading days, we calculate the normalized cumulative-sum change-point 


estimate (see Equation (39)) for each cross-day order flow series . Because the 
lengths of the intra-day series vary with the number of arrivals and departures 
each day, the locations of the boundaries between different trading days vary 
across different cross-day series. Consequently, the boundary between trading 
days Di and does not necessarily lie at the mid point in the relevant 

cross-day series. We therefore introduce a normalization to enable comparisons 
between different cross-day series. Given a cross-day series of length N with 
daily boundary r* G {2, 3,..., iV — 1}, and given a change-point estimator f* 
for r*, the normalized change-point estimator is 


{r*-r*)/r*, 

ir*-r*)/ir-r*), 


if r* < r*, 
otherwise. 


( 10 ) 


Observe that f G [—1,1], and f = 0 if and only if f* = r*. 

In Figure]^ we plot the empirical cumulative density functions (ECDFs) 
of r for each of the three currency pairs’ cross-day arrival-sign series. The 
results for the cross-day departure-sign series are qualitatively similar. For 
the EUR/GBP cross-day series, there is no visible jump in the EGDF near 
f = 0. Therefore, the cumulative-sum change-point estimator performs very 
poorly at detecting the true locations of the daily boundaries in the cross-day 
series for this currency pair. For EUR/USD and GBP/USD, there is a small 
but discernible spike in the ECDFs near to f = 0, which indicates that the 
estimator performs somewhat better for these currency pairs. However, the 
distribution of the estimator’s output across different trading days is still very 
broad, which suggests that its performance is still rather weak. Together, these 
results suggest that the apparent long-memory properties of the cross-day series 
are not strongly influenced by nonstationarities at the daily boundaries. 

To test the hypothesis of nonstationarity more formally, we also perform 
Berkes’ change-point test (see on both the intra-day and cross-day order- 
flow series. We repeat the test for each of 0, 1, and 2 change points. We show 


box plots of our results in Figure 10 


For each of the three currency pairs and for each of 0, I, and 2 change-points, 
the test rejects the null hypothesis at the 1% significance level in all intra-day 
and cross-day arrival-sign and departure-sign series. The results for the intra¬ 
day series are again very similar to those for the cross-day series. This result 
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Figure 9: ECDFs of the normalized cumulative-sum change-point estimator f 
(see Equation (101 for the (solid green curve) EUR/USD, (dashed orange curve) 
GBP/USD, and (dotted-dashed purple curve) EUR/GBP cross-day arrival-sign 
series. The results for the cross-day departure-sign series are qualitatively sim¬ 
ilar. The dotted black curve illustrates the estimator’s null distribution for a 
second-order stationary series with no structural breaks, which we estimate by 
calculating the ECDF of the normalized cumulative-sum change-point estimates 
from 100000 independent series that each consist of 1000000 random variables 
drawn independently and at random from the standard normal distribution. 
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Figure 10: Box plots of Berkes’ change-point test statistic M (see E with (top 
row) 0 change points, (middle row) 1 change point, and (bottom row) 2 change 
points, for EUR/USD, GBP/USD, and EUR/GBP arrival-sign series. The left 
plots show the results for the intra-day series, and the right plots show the 
results for the cross-day series. The boxes indicate the lower quartile, median, 
and upper quartile of M, and the whiskers indicate the minimum and maximum 
of M, across all intra-day or cross-day series. In each case, we use Andrews’ 
[T] data-driven plug-in estimator q from Equation (29) to calculate M. The 
light grey shading indicates the critical region for Berkes’ change-point test at 
the 1% significance level. The results for the cross-day departure-sign series are 
qualitatively similar. 
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provides strong evidence against the hypothesis that the apparent long-memory 
properties of the series are actually artifacts caused by nonstationarities. We 
therefore strongly reject Axioglou and Skouras’ [2] hypothesis that the apparent 
long memory in the cross-day series is mostly an artifact caused by structural 
breaks at the daily boundaries, in favour of the alternative hypothesis of true 
long memory that persists across daily boundaries. We return to this discussion 
in Section [6l 

6 Discussion 

The results of our statistical tests strongly support the hypothesis that both the 
arrival-sign and departure-sign series on Hotspot FX exhibit long memory that 
persists over several thousands of events. For each of the three currency pairs, 
our calculations suggest that this long memory in order flow can be characterized 
by a Hurst exponent of i? « 0.7. The variation in this result — both across the 
different currency pairs and across the different days in our sample — is small. 

Our results raise several interesting points for discussion. As we noted in 
Section several authors have argued that order splitting provides a more 
plausible explanation for long memory in order flow than does herding. Our 
results provide further evidence to support this argument. Other empirical 
studies of long memory in order flow have studied LOBs in which all institutions 
observe all order flow from all others. On Hotspot FX, by contrast, institutions 
can only see the order flow that originates from institutions with whom they 
possess bilateral credit (see Section |^ . It is therefore reasonable to assume 
that herding effects are much weaker on Hotspot FX than they are on other 
platforms. Despite this important difference, our estimates of H on Hotspot 
FX are very similar to those reported for other platforms (see Section]^. We 
therefore argue that herding plays a minor role in the long-memory properties 
of order flow and that long-range autocorrelations are instead caused by order¬ 
splitting strategies (which are unaffected by the quasi-centralized nature of trade 
on Hotspot FX). 

We turn next to the low-order negative autocorrelation that we observe in the 
sample ACFs (see Figure]^. Although this negative autocorrelation is relatively 
weak, the effect is present for each day in our sample, and we therefore deem 
it to be a robust statistical property of the data. Interestingly, several other 
studies of long memory in order flow have not reported this behaviour and have 
instead reported positive autocorrelations at small lags. This raises the question 
of why our results differ. We believe that the answer to this question lies in the 
extremely high activity levels in the FX spot market. In a study of order flow 
on the LSE, Toth et al. m used broker identiher codes to trace which order 
flow originated from which brokerage. For limit order arrivals, they reported 
that orders originating from the same broker were positively correlated with 
each other, but that orders originating from different brokers were negatively 
correlated with each other. Because of the extremely large number of partici¬ 
pants in the FX spot market, an arriving limit order is likely to be followed by 
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many other arriving limit orders from other participants. According to Toth 
et al.'s findings, this should generate short-range, negative autocorrelations in 
order flow. At larger lags, however, the strength of this effect diminishes and 
the long-range autocorrelation effects become more apparent. We conjecture 
that the short-range negative autocorrelations were not reported in some other 
studies because this effect is weaker (and, therefore, not observable) in markets 
with lower activity levels. 

It is also interesting to consider why the cumulative-sum change-point esti¬ 
mator (see Equation |39[ ) performs much worse at detecting the daily boundaries 
between intra-day order-sign series in the FX spot market than was reported by 
Axioglou and Skouras for the LSE [5]. We conjecture that the answer lies in the 
structural differences between the trading days in these two markets. Trading 
on the LSE commences at 08:00:00 and ceases at 16:30:00 each day [65]. By 
contrast, trading in the FX spot market occurs 24 hours a day. Although we 
restrict our attention to the peak trading hours of 08:00:00-17:00:00 GMT, the 
absence of a market-wide closing time has several important consequences for 
the way that traders act. 

First, many financial institutions require that traders unwind their positions 
(i.e., rebalance their net daily holdings to 0) before the end of each trading day 
[15] . The LSE market closure at 16:30:00 constitutes a hard deadline by which 
any traders who seek to unwind their positions must fulfill this goal, even if 
doing so requires them to trade at unfavourable prices. This may cause the 
statistical properties of order flow late in the trading day to differ substantially 
from those early in the trading day, and may therefore result in a structural 
break at the daily boundaries when concatenating data from different days. In 
the FX spot market, by contrast, there is no market-wide closing time. 

Second, the absence of market opening and closing times in the FX spot mar¬ 
ket enables traders in different time zones to begin and end their trading days 
at different times. Hsieh and Kleidon |35j noted that many traders spend the 
early part of their trading day assessing the state of the market, the middle part 
of their trading day performing the majority of their trades, and the late part 
of their trading day resetting their net inventory to 0. In markets with specified 
opening and closing times (such as the LSE), all traders progress through this 
cycle simultaneously. In the FX spot market, by contrast, traders in different 
time zones can choose the length and timing of their trading days as they wish. 
The flux of traders from different time zones into and out of the FX spot market 
may cause the statistical properties of order flow to differ from those in markets 
where all traders’ trading days are aligned by the market opening and closing 
times. 

Third, Axioglou and Skouras |2] conjectured that most traders on the LSE 
reassess their trading strategy once per day, while markets are closed, then 
implement their chosen strategy throughout the next trading day. This is not a 
plausible description of the actions of traders in the FX spot market, which is 
always open. 

Finally, we return to the question of whether the apparent long memory that 
we observe is actually an artifact caused by structural breaks. Although we test 
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and strongly reject this hypothesis for zero, one, and two structural breaks, our 
results do not rule out the possibility of larger numbers of structural breaks. In¬ 
deed, it is plausible that intra-day order-sign series contain very large numbers 
of structural breaks, for reasons such as the arrivals or departures of institutions 
in the market or the release of macroeconomic news. From a modelling perspec¬ 
tive, however, models that rely on large numbers of structural breaks that occur 
at unknown points in time have several important drawbacks, such as being dif¬ 
ficult to estimate and of little use for forecasting (see[^. Moreover, due to their 
extremely large numbers of parameters, such models suffer a considerable risk 
of overfitting. By contrast, long-memory models are parsimonious, easy to sim¬ 
ulate, and useful for forecasting. We therefore regard long-memory approaches 
to be more useful than alternative approaches that rely on fitting large numbers 
of structural breaks to explain the observed autocorrelation properties of the 
order-sign series. 

It is important to note that our results do not rule out the possibility that 
order-sign series contain both long memory and structural breaks. Indeed, the 
co-existence of these effects could help to explain the differences between the 
behaviour observed in different markets. According to our findings, intra-day 
order flow in the FX spot market exhibits long memory. Any possible struc¬ 
tural breaks appear to have little impact. In equities markets, by contrast, the 
relatively low number of order arrivals each day makes precise empirical assess¬ 
ment of long memory much more difficult, and structural breaks appear to have 
a greater impact on the apparent long-memory properties of order flow. It is 
plausible that both structural breaks and true long memory could coexist in all 
financial markets and that the relative importance of these effects is different 
for different assets. 


7 Conclusions 

In this paper, we have investigated the long memory of order flow in the FX 
spot market. Due to the extremely high levels of activity on the platform that 
we study, and in contrast to other empirical studies on this topic, we were able 
to investigate the long-memory properties of intra-day series without needing 
to aggregate data from different trading days. For each of the three currency 
pairs and on each of the trading days that we studied, we found that both 
arrival-sign and departure-series exhibit long memory, with a Hurst exponent of 
H « 0.7. We also uncovered a negative autocorrelation at shorter lags, which 
we conjectured was caused by the large number of participants in the FX spot 
market. 

All of our results for data that crosses daily boundaries were similar to those 
for intra-day data, and we strongly rejected the hypothesis that the apparent 
long memory of order flow is an artifact caused by structural breaks. We there¬ 
fore concluded that long memory is a robust statistical property of order flow 
on Hotspot FX that persists across daily boundaries. We also proposed several 
possible reasons why our findings differ from those reported by Axioglou and 
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Skouras for the LSE [2]. Further empirical study of data from other markets 
will help to illuminate these issues further, and is therefore an important topic 
for future research. 

Finally, we note that the existence of long memory in order flow raises an 
interesting question called the “efficiency paradox” |24| : how can return series 
remain unpredictable given that order flow exhibits long memory? To date, 
there are two main hypotheses. Some authors naE] have argued that markets 
reside at a “self-organized critical point” in which liquidity takers cause long- 
range autocorrelations in order flow that exactly balance the long-range negative 
autocorrelations caused by liquidity providers. Others min] have argued that 
predictability in order flow is offset by a negative correlation with available 
liquidity. At present, there is no clear consensus as to which approach best 
describes the temporal evolution of real markets, and further empirical and 
theoretical study of this question remains an important and exciting avenue for 
future research. 


A Long Memory 

In this appendix, we provide a detailed description of long-memory processes. 
For further discussion of these topics, see [5] . 

A.l Autocorrelation and Long Memory 

Recall from Section |2.1| that a second-order stationary time series {Wt\ = 
Wi,W 2 , ■. ■ with ACF p{k) (see Equation Q) is said to exhibit long memory if 

N 

lim ^ \p{k)\ = oo. (11) 

k=-N 

One way in which a time series can exhibit long memory is if there exists 
some constant a G (0,1) such that p{k) decays asymptotically as a power of fc: 

p{k) ~ k~°'L{k), k —>■ oo, (12) 

where L is a slowly varying functior0 |17llTTlH^ . Smaller values of a correspond 
to slower decay of the long-range autocorrelations in {Wt} [T^ HT] . 

A.2 The Rescaled-Range Statistic 

Let 

j=t+i 

®A function L is slowly varying if lims._^oo L{zk)/L{k) = 1 for all z > 0. 
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The rescaled-range statistic |51| is the ratio 


Q{t,k) 


R{t, k) 
S(t, k) ’ 


where, for t,k G Z>o and for i G {1, 2,..., fc}, 


(14) 


R(t. k) = max 

l<i<k 


t-\-i 


t-\-i 

E iWo-Wk) 

— min 

l<i<k 

E 

J=t+1 


J=t+1 


(15) 


and 


S\t,k) 


t+k 

j=t+l 


(16) 


The rescaled-range statistic Q measures the range of partial sums of deviations 
of the time series {Wi, W 2 , •..} from its mean, rescaled by an estimate of its 
standard deviation [44] , 


A.3 The Hurst Exponent 

The following theorem by Mandelbrot m provides a relationship between a 
time series’ long-range autocorrelations and its rescaled-range statistic. 

Theorem A.l. If {Wt} is a second-order stationary proeess such that Wf is 
ergodic and t~^ X)i=i converges weakly to a fractional Brownian 
with parameter H as t > 00 , then 

k~^Q{t, k) ^ Tj as k ^ 00 , (17) 


where rj is a nondegenerate random variable and denotes convergence in 
distribution. 


The constant H is called the Hurst exponent of {Wt} P IMl 1501ICT1^ . 
A time series with a Hurst exponent oi H = 1/2 is a short-memory process. 
For a long-memory process that satishes the conditions of this theorem, H is 
related to a in Equation (12) by 



(18) 


and to /3 in Equation (35) by 


H = 


/? + ! 

2 


(19) 


fractional Brownian motion |48| is a Gaussian process Bu{t) with 0 drift that satisfies 
Sh( 0) = 0 and E [B„ {t)B h[s)] = 1 -\t- sp") for some H e (0,1). 
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B Empirical Assessment of Long Memory 

In many empirical situations, it is common to observe only a single, finite- 
length realization {wi,W 2 , ■ ■ ■ ,wn} of {Wt}. If the statistical properties of 
{Wt} are unknown, then estimating the long-memory properties of {Wt\ from 
{wi,W 2 , ■ ■ ■, wn} entails considerable challenges [5Jini|S3]. Most empirical stud¬ 
ies employ heuristic methods for this task. The performance of such techniques 
on empirically observed series varies considerably, so it is common for empirical 
studies to evaluate the output of several heuristic methods rather than relying 
on a single estimator. In this appendix, we provide a detailed description of 
the techniques that we use throughout the paper. For further discussion and 
comparisons of these techniques, see [S5]. 


B.l Sample Autocorrelation Function 

For an empirically observed time series {wi,W 2 , ■ ■ ■, wn}, let 


w= ( 20 ) 

' i=l 

denote the sample mean, let 

^ 7V-|fc| 

= ]v ^ {wi+\k\ - w) - w) (21) 

2=1 


denote the sample autocovariance function, and let 


p{k) = 


7(fc) 

7 ( 0 ) 


( 22 ) 


denote the sample ACF. It is very difficult to estimate the large-fc decay of p{k) 
from p, so direct estimation of the long-memory properties of {Wt\ from 7 often 
produces very poor results mi. 


B.2 Rescaled-Range Plots 


For a given block number S S N, let 


G(k) = 


r 


+ I|i = l,...,i3; t + k < N 


( 23 ) 


A rescaled-range plot (also known as a pox plot) [IHl [M] is a plot of 


R{k) 


1 

Wm 


E C(iA) 

teG{k) 


( 24 ) 


versus k on doubly logarithmic axes, where \G{k)\ denotes the number of ele¬ 
ments in G{k). The slope of a rescaled-range plot for large values of k provides 
a rough estimate for the Hurst exponent H m- 


28 





B.3 Lo’s Modified Rescaled-Range Statistic 

Lo [44] noted that if a time series {Wt} is subject to short-range autocorrela¬ 
tions, then the denominator S{t,k) of the rescaled-range statistic Q{t,k) is not 
a consistent estimator for the standard deviation of {Wt}. Therefore, an im¬ 
portant difficulty in using the rescaled-range statistic to assess the long-memory 
properties of an empirically observed time series {rci, W 2 , ■ ■ ■, wn} is that the 
finite-sample properties of Q are not invariant to short-range dependence. To 
address this problem, Lo proposed replacing the denominator of Q with the 
Newey-Wes10 estimator which discounts short-range dependence in {Wt} 
up to a specified lag q < N. The parameter q is called the bandwidth parameter. 
Lo’s modified rescaled-range statistic (Hj is 


Qiq) 


RjhN) 


(25) 


where 




iiq = 0, 

5^(1, TV) + 2j2Ui (l - 7(*)> otherwise. 


Given Q{q), the statistic 


V{q) = 


Q{q) 

y/N 


can be used as a test statistic for the hypothesis test 


(26) 


(27) 


Hq '. {Wt} is a short-memory process, 

Hi '. {Wt} is a long-memory process. 

This hypothesis test is called Lo’s modified rescaled-range test |44] . In the limit 
N —>■ oo, the asymptotic critical region for the test at the 5% significance level 
is approximately [0.809,1.862]. 

Teverovsky et al. [64] remarked that although Lo’s modified rescaled-range 
statistic is a significant improvement over the original rescaled-range statistic, 
Lo’s test can fail to reject Hq for some time series with long memory. Moreover, 
they noted that both the size and the power of the test depend on q. The optimal 
choice of q for Q{q) and V{q) depends on the behaviour of the spectral density 
/ of {Wt} P. If / is unknown (as is usually the case for empirically observed 
series), then there is no universal rule via which to choose q. In empirical 
applications, it is therefore customary to calculate Q{q) and V{q) using several 
different choices of q and/or to calculate a so-called plug-in estimator q [Tll^Hi] 
by assuming that / is equal to the spectral density of a specified parametric 
process. Andrews ^ derived plug-in estimators for several different parametric 
processes (including autoregressive, moving-average, and ARMA models). 

®For a suitable choice of q, Newey and West El showed that o-(q) is a consistent estimator 
for the standard deviation of {Wt}, even if {Wt} is subject to short-range autocorrelations. 
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For our calculations, we use the plug-in estimator for an AR(1) process 


Wt = cj)Wt-i + et, 


(28) 


where (() € K is the autocorrelation parameter and St is uncorrelated Gaussian 
noise. This estimator is given by [mm 


q = 




(29) 


where (j) is the maximum-likelihood estimate of cj) given {wi,W 2 , ■ ■ ■ ^wn} and 
[ccj denotes the greatest integer less than or equal to x. 


B.4 Detrended Fluctuation Analysis 

Detrended fluctuation analysis (DFA) |58j is a technique for estimating the Hurst 
exponent from an empirically observed series {wi,W 2 , ■ ■ ■ ,wn}- Let 

i 

z = l,2,...,7V. (30) 

For a given window length m G N such that m < N, divide {ic*, rcj, • ■ •, 
into non-overlapping windows of length m. For each j S {1,2,..., \_N/m \}, label 
the data points in window j as yiy, ■ • • jUmj, perform an ordinary least- 
squares regression to fit a straight line to the m data points in the window, 
and let yij denote the value of the regression line at the point for each 
i G {1,2,..., to}. For each window, calculate the detrended standard deviation 


Fj{m) = 


\ 


ivij yi,j) I 


i=l 


and then calculate the length-m mean detrended standard deviation 

\_N /m\ 


F{m) = 


1 


[7V/toJ 




(31) 


(32) 


i=i 


Repeat this process for several logarithmically spaced choices of window length 
TO, and plot F(rn) versus to using doubly logarithmic axes. Identify the smallest 
value TOmin such that the plot is approximately straight for all to > TOmin- The 
DFA estimate of FI is given by the slope of the best-fit line for to > TOmin. 


B.5 Log-Periodogram Regression 

The long-memory properties of {Wt} can also be characterized by the behaviour 
of its spectral density [5] 

^ OO 

= ^ E 7(fc)e-*'=" (33) 

k——oo 
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in the limit A —>■ 0. If there exists a constant I G M such that 


/(A) —Z as A —>■ 0, 


(34) 


then {Wt} is a short-memory process. If, by contrast, there exists a constant 
/3 G (0,1) such that 

/(A) ~ A"^ as A ^ 0, (35) 

then {IFt} is a long-memory process. Larger values of j3 correspond to slower 
decay of the long-range autocorrelations in {W^} [6j. 

Estimating the behaviour of /(A) close to 0 provides an alternative approach 
to estimating the Hurst exponent H. Let 


j G 1, 2,..., 


iV- 1 


denote the Fourier frequencies of {wi,W 2 , ■ ■ ■ ,u'Ar}, and let 

2 


H^3,n) = 


1 


2TrN 


N 




(36) 


(37) 


denote the corresponding periodogram. The slope of an ordinary least-squares 
regression of log (/(Aj_Ar)) onto log {Xj,N) for small is an estimator for —(3, 
and it is therefore an estimator for H laiiiiiM]- 

Despite the attractiveness of its computational simplicity, log-periodogram 
regression suffers from a substantial practical drawback [6] : there is no universal 
rule for choosing the number c of Fourier frequencies with which to perform the 
regression. The scaling in Equations (34) and (35) only holds for A —>■ 0, so 
choosing an overly large c leads to large bias, but choosing an overly small c 
leads to high variance. Robinson [60| derived an expression for the optimal 
choice of c to minimize the mean squared error of the estimated cumulative 
spectral distribution function, but the optimal choice turns out to depend on 
the unknown value of H. Therefore, Robinson’s expression does not provide a 
method for choosing c for an empirically observed time series. Instead, most 
empirical studies use one of two rules of thumb: c = '/N [2^ or c = 0.1 x (iV/2) 
[62]. Despite their widespread use, neither of these rules are based on rigorous 
derivation or optimization. 


C Long Memory versus Nonstationarity 

A key difficulty with assessing the long-memory properties of empirically ob¬ 
served time series is that many estimation techniques can produce similar out¬ 
put for a nonstationary series (e.g., a series with a monotonic trend |S] or change 
in mean [HIISI]) as they do for a stationary series with long memory ISni El HD- 
Several authors have thus argued that the apparent long memory reported by 
empirical studies of financial time series is an artifact caused by nonstationarities 

UlllliSllle]. 
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Disentangling the statistical properties of a time series with long memory 
and a time series with nonstationarities is a difficult task, and the choice of 
whether to model such time series using a long-memory model or a nonsta¬ 
tionary model often depends on the desired application. Long-memory models 
are parsimonious, straightforward to simulate, and there exist many techniques 
that require only mild assumptions to estimate their parameters from data [5]. 
Nonstationary models can illuminate important features of an underlying se¬ 
ries (such as the locations and frequency of structural breaks) that are not 
addressed by long-memory models, but they typically require the inclusion of 
either a large number of parameters (which can lead to over-fitting) or latent 
parameters (which can be difficult to estimate from data). 

Many standard tests for nonstationarities in an empirically observed series 
have low power in the presence of long memory |211134j , and there is no universal 
test that is able to determine whether the apparent long-memory properties of 
an empirically observed time series are an artifact caused by some unknown 
form of nonstationarity m- However, Berkes et al. [J developed a hypothesis 
test that can help to distinguish between long memory and a specihc type of 
nonstationarity that they called structural breaks. 

Definition. The time series {Zt} is a short-range dependent series with a 
structural break at time r* if there exists a real-valued, second-order stationary, 
short-memory process {Ct} and a constant /i* 0 such that 


I Ct, t<r*, 

\yL*+Cu t>r*. 


(38) 


Given a finite-length empirical observation {zi, Z 2 , ■ ■ ■, Zff} of a time series 
{Zt}, Berkes’ change-point test is the hypothesis test 

Hq '■ {Zt} is a short-memory process with one structural break 
Hi : {Zt} is a long-memory process. 


Berkes’ test uses the so-called cumulative-sum change-point estimator [7] 


r = mm < r : max 

1<3<N 


For a given q < N, lei 


N 




N 




(39) 


y(l) = 


y(2) _ 


max 




a-f^){qWN -f* f*<i<N 


N 

E 


max 


N 


I — r 
jV - f* 

j^r 


E 
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where (T(i)(g) and ^{ 2 ){q) are the values of d{q) from Equation (26) for the 
{zi,Z 2 ,..., Zf*} and {zf+i, Zf*+ 2 , ■ ■ ■, zn} series, respectively. The test statistic 
for Berkes’ test is 

(40) 


M = max|r(i\r(2)|. 


In the limit N —>■ oo, the asymptotic critical value of M at the 1% significance 
level is about 1.72 [7]. 
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